Stripe Minions, Amazon Kiro, Claude Code compaction, and a Replit DB deletion. We synthesize multiple cases to extract the design principles required to operate AI coding agents in production, and organize them alongside CodeRabbit's 470‑repo statistics plus efforts from Google and GitHub.
Kiro autonomously deleted production, causing 13 hours of AWS downtime; Claude Code’s auto-compaction irreversibly erases context; sub-agents silently burn through usage. Three incident reports from the same week.
Stripe’s Minions agent generates 1,300+ PRs per week with zero human effort. Implementation details of the four components: Devbox, Blueprints, Toolshed, and a fork of goose.